Slope at Zero Crossings (zc) of Speech Signal for Multi-speaker Activity Detection
نویسندگان
چکیده
Multi-Speaker activity (MSA) detection helps in detecting the presence of whether the speech signals has a single speaker or multiple speaker speeches in the speech signal. It is easy to calculate the slope at ZCs (zero crossings) of the speech signal and makes a comparison with a suitable threshold (Th). Multi-speaker is declared as and when the zero crossing value exceeds the threshold. The impact of the proposed technique is compared to the existing technique by calculating the sample-by-sample ZCR (Zero crossing rate) value is demonstrated. Experimental results prove that the proposed ZCR technique achieves accurate results than the traditional techniques for MSA detection that uses the cepstrum resynthesis residual magnitude (CRRM) in the literature.
منابع مشابه
Automatic Segmentation of the Clean Speech Signal
Speech Segmentation is the measure of the change point detection for partitioning an input speech signal into regions each of which accords to only one speaker. In this paper, we apply two features based on multi-scale product (MP) of the clean speech, namely the spectral centroid of MP, and the zero crossings rate of MP. We focus on multi-scale product analysis as an important tool for segment...
متن کاملZero-crossing analysis technique in speech and speaker recognition
The resul ts of previous studies showed a possibili ty of application of zero-crossing rates as parameters for speech and speaker recognition The paper presents a method for obtaining these parameters and the resul ts of experiments carried out in order to estimate the absolute and relative discriminating power of zero-crossing parameters in speech and speaker recogni tion. As a basic test mate...
متن کاملA New Algorithm for Voice Activity Detection Based on Wavelet Packets (RESEARCH NOTE)
Speech constitutes much of the communicated information; most other perceived audio signals do not carry nearly as much information. Indeed, much of the non-speech signals maybe classified as ‘noise’ in human communication. The process of separating conversational speech and noise is termed voice activity detection (VAD). This paper describes a new approach to VAD which is based on the Wavelet ...
متن کاملA Microprocessor based Speech Recognizer for Isolated Hindi Digits
A novel method for recognition of isolated spoken words on an 8-bit microprocessor is presented. The method uses a new but simple feature vector based on the zero-crossings of the speech signal. The feature vector is the histogram of the time-interval between successive zero-crossings of the speech signal. Dynamic time warping is used to calculate a time-aligned normalized distance between the ...
متن کاملSpeaker independent voiced-unvoiced detection evaluated in different speaking styles
We propose a new algorithm for voiced/unvoiced classification of speech on a phoneme or sample level. The algorithm is inspired by auditory based approaches and combines two cues. One cue is based on the energy distribution of the signal and the other on the harmonicity. In order to extract the harmonicity of the signal we calculate a histogram of the zero crossings of the filter channels after...
متن کامل